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Abstract 

We prove that the codes issued from the elimination of any sub- 
alphabet in a trace monoid are finite-state recognizable. This implies 
in particular that the transitive fatorizations of the trace monoids are 
recognizable by (boolean) finite-state automata. 

Keywords: Trace monoid; Lazard's elimination; automata with 
multiplicities. 



1 Introduction 

Schiitzenberger ([H] Chapter 5) introduced the notion of a factorization of a 
monoid M 

M = f[M i (1) 
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where (Mj)j g / is a subfamily of submonoids of the given monoid M. When 
M = A* is a free monoid, at the both ends of the chain, one has complete 
factorizations like Lyndon and Hall factorizations and the bisections 
\I\ = 2 0. 

A nice way to produce factorizations is to start with a bisection M = M\Mi 
and refine the factors using a uniform process. Doing this, we could obtain a 
complete factorization for every trace monoid [3] • Trace monoids are defined 
as follows. Consider an alphabet S = {x±, - ■ ■ ,x n } and a commutation 
relation i? (i.e. a reflexive and symmetric relation) on S. The trace monoid 
M(S, i?) is the quotient 

M(E,0) = (2) 

where is the congruence generated by the relators ab = ba where (a, b) € 
•&. 

Later on, we adressed the question of bisecting a trace monoid so that the left 
factor be generated by a subalphabet (Lazard bisection) and the right factor 
be a trace monoid 5 . Doing so, we obtained a complete description of the 
factors and graph-theoretical criteria for the factorization. We conjectured 
that the trace codes so obtained could be recognized by finite-state automata 

Bin this paper, we prove that the answer to the conjecture is positive. 
This will be a consequence of the more general result that if a trace monoid 
M(S,i?) is bisected as 

Af(E,i?) = L.Af(B,i?fl) (3) 

with BcS and $b = $ H (B x B)), then the minimal generating set (3(L) 
of L is recognizable by a finite-state, effectively constructible automaton. 
Here, we prove this fact and give the construction of the automaton. 
The paper is organised as follows: 

In section 2, we recall basic notions related to trace monoids and recog- 
nizability. In section 3, we prove that the left factor of a Lazard bisection 
is a recognizable set and we describe the construction of a deterministic au- 
tomaton recognizing it in section 4. To end with, we explain in section 5 
how to construct a deterministic automaton which recognizes the generating 
set of the left factor of such a bisection. 

2 Trace Monoids 

Trace monoids were introduced by Cartier and Foata with the purpose of 
studying some combinatorial problems linked with rearrangements (see j2]). 
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Next, this notion has been studied by Mazurkiewicz and many schools of 
Computer Sciences in the context of concurrent program schemes (see 

nni). 

Let x € E be a letter and denote Com(x) the set of letters which commute 
with x 

Com(x) = G $}. (4) 

In particular, one has x £ Com(x). Let w £ M(E,$) be a trace, we will 
denote 

TA(w) = {xe T,\w = ux,ue M(E, #)} (5) 

the terminal alphabet of w. 

As it is shown in 3_, Lazard elimination occurs in the context of traces. 
Let fibea subalphabet of E and $b = t?fl(BxB). The trace monoid splits 
into two submonoids 

M(E,d)=L.M(B,d B ) (6) 

where L is the submonoid consisting in the traces whose terminal alphabet is 
a subset of E \ B. Furthermore the decomposition is unique, which suggests 
that the following equality occurs in Z(E,i9) = Z[M(E, •&)], the algebra of 
series corresponding to Z[M(E, $)] [I]. Thus, 

M(E, ■&) = L. M(B,0 B ) (7) 

where S_ denotes the characteristic series of a subset S C M(E, ■&) i.e. 

5=E«Z((S,>J)). (8) 

w£S 

Let 4> be the natural surjection E* — > M(E, i?), the set of the representative 
words of a trace t is defined as Rep(t) = (/> -1 (t). We can extend this defini- 
tion to trace langages Rep(L) = ^ _1 (L) = UteL ^ (*)• trace langage is 
said recognizable if and only if its representative set is, and we say that an 
automaton recognizes L if and only if it recognizes Rep(L). 

Example 1 Let a and b be two commuting letters, then the set Rep({a6}) 
is recognized by the automaton 
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In fact, one can prove that a rational language is a set of representatives (i.e. 
it is saturated w.r.t. the congruence =g) if and only if the corresponding 
minimal automaton shows complete squares as above. 

We will denote Rec(£,#) the set of recognizable sets of traces. 



3 Recognizing the left factor 

The Z-rationality of the left factor L is a direct consequence of the unicity 
of the decomposition, which, in term of formal series, reads 

M(S,fl) = L. M(B,ti B ) - (9) 

where S denotes the Z-characteristic series of the set S (i.e. S = J2xes x )- 
Indeed, by a classical result due to Cartier and Foata ([2] Theorem 2.4) the 
Z-characteristic series of M(S, #) is rational when the alphabet S is finite a 



M(S,0) = ^ 1 —— • (10) 

{ai,...,a„}£Cliques(S) 

where the sum at the denominator is taken over the set Cliques(S) of 
the cliques of S (i.e. commutative sub-alphabets). Hence, one obtains the 

a The formula holds also when the alphabet is infinite but the denominator is then a 



4 



rational equality 



L = 



E (-i) B *i 

{ai,...,a„}eCliques(S) 



E (-l) B 6l-"6r. 
.,b„}eCliques(B) 



(11) 

Nevertheless, this remark is not sufficient to show that L is recognizable as a 
language. Furthermore, for traces, one has the strict inclusion Rec(£,i9) C 
Rat(£,#). To prove that L is recognizable it suffices to find a construction 
of Rep(L) using only recognizable operations. For each letter x € S, let 
TN X be the set of representative words of traces whose terminal alphabet 
does not contain x. Remarking that Rep(L) is the representative set of the 
traces whose terminal alphabet contains no letter of B, one has 



Rep(L) 



fees 



(12) 



Hence, Rep(L) is recognizable if each TN^ is. But, one can easily verify 
that automaton Ab- 

T)z=ZZIIIIZZ=z(2) Com(6) 



A -{b} 

A\Com(b) 

recognizes TN&. Thus, we have the proposition 





Proposition 1 L is a recognizable submonoid o/M(S,i?). 



4 A deterministic automaton for a terminal con- 
dition 

One can compute a deterministic automaton recognizing L generalizing the 
construction of Ab- We consider an automaton Ab = {Sb, Ib, Fb,Tb) such 
that: 

1. The set Sb of its states is the set of all the sub-alphabets of B, 

2. There a unique initial state Ib = {0}, 

3. There a unique final state Fb = {0} = Ib, 

4. The transitions are 

T B = {(B', x, ((B' U {x}) n Com(x) D £))} B 'cB,*6E- 



5 



One has 



Proposition 2 The automaton Ab is a complete deterministic automaton 
recognizing Rep(L). 

Proof It is straightforward to see that such an automaton is complete and 
deterministic. Now, let us prove that it recognizes Rep(L). As Al is com- 
plete deterministic, for each word w = a± ■ ■ ■ a n we can consider a state s w 
which is the state of Ab after reading w. More precisely, we can define s w 
as s w = s n in the following chain of transitions 

(0, ai, si), (si, a 2 , s 2 ), • ■ ■ , (s n -i, a n , s n = s w ). (13) 

We first prove that if w is a word then s w = TA(t w ) n i? where t w denotes 
the trace admitting w as representative word. We use an induction process, 
considering as starting point: (0, x, {x} (1 B) where i£S, Let w = a\ - ■ ■ a n 
be a word of length n, such that s w is the intersection between B and the 
terminal alphabet of trace t w . Let a n+ \ G a be an other letter. One has, 
(s w ,a n+1 ,s wan+1 ) G T B . Hence, the set s„, an+ i is 

s W a n+1 = (s w U{a n+1 })f]Com{a n+1 }nB = TA(t w a n+1 )f]B = TA(t wan+1 )f]B. 

(14) 

This proves our assertion. Then, the set of words w such that s w — is 
exactly the set of representative words of L. [— ] 

Example 2 We consider the trace alphabet given by the following commu- 
tation graph 



e 




e d a b 

If we set B = {a, b}, then L is recognized by the following automaton (in 
the figure the only initial state and the only final state is 0): 
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a, c, d 




5 A deterministic automaton for the generating 
set of the left factor 

Each submonoid M of a trace monoid has an unique generating set which 
is the subset G(M) = M\M 2 . h 

In this section, we prove that G(L) is recognizable and we construct an 
automaton Ap which recognizes it. The automaton Ap is obtained from 
Ab by adding two states F,H, choosing F as final state instead of and 
modifying the transitions in such a way that if a letter of Z = A — B is read, 
the state reached belongs in F, H and the other states become unreachable. 

More precisely, one considers the automaton Ap = (Sp, Ip, Fp, Tp) ob- 
tained from the automaton As = (Sb, Ib, Fb,Tb) computed in the previous 
section as follows: 

1. The set of its states Sp, is the set of the sub-alphabets of B plus two 
states F and H, 

2. There is a unique initial state Ip = {0}, 

3. There is a unique final state Fp = {F}, 

4. The transitions are 

Tp = Tb^b U Tb^f U Tb^h U Tf^h U Th^h 

b The fact that G(M) generates M is straightforward and the unicity comes from that 
the Z-characteristic series of G(M) is the inverse of the Z-characteristic series of M in 
Z((A)). 
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where 



(a) T b ^b = {{B',b,B")} B i,B"cB,beB,(B>,b,B")eT B i 

(b) T B -+f = {{B 1 , z, F)}( B > ,z,W)&r B ,B'cB,B'^®,zeZi 

(c) T B ^H = {{B' , Z, H)}(b' ,z,B")£T B ,B' ,B"?{<l),F,H},zeZ, 

(d) T F ^ H = {(F,x,H)} xeS , 

(e) and T H ->h = {(H,x,H)} xe ^. 



Proposition 3 The automaton Ap recognizes Rep(G(L)). 

Proof The automaton is almost the same as As- As for As, if a word of B* 
is read, the automaton is in the state corresponding to its terminal alphabet. 
The difference appears when a letter of Z is read, if it is read from the 
state the automaton goes to the state F. Consider now a word w = w'z 
with w' € B + , z € Z. We denote 6 W the state of the automaton after 
reading w (this definition makes sense as, like Ab, Ap is deterministic). 
Now, if {z} = TA(w'z), then (S' w ,z, F) G Tb^f which means that w is 
recognized by Ap, otherwise (S' w ,z,H) £ Tb^h and w is not recognized 
by Ap. Furthermore, for each z € Z and 6 6 6, 5 w > zaw " = H (for each 
w',w" £ £*). This ends the proof. ^ 

Example 3 Consider again the example Then, (3 is recognized by the 
automaton 




b 
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6 Conclusion 



The factorisations of free monoids (or in a more general setting of a monoid 
constructed by generators and relations) is a relevant topic in the context 
of the theory of codes Lazard bisections, or more generally rational 
bisections [7j, play a role in the construction of bases of free Lie algebras 
[TT] and the study of circular codes ^ . A natural question asks if it is 
possible to generalize these properties to other monoids in particular when 
the free module over these monoids can be endowed with a shuffle coproduct 
[H] . The results contained in the paper consist in a step in the study of these 
problems for the trace monoids. The role played by the Lazard bisections 
in this context is not still completely known (see [3J[S| for some results). 
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